Improve reference files: graduated challenge, story freshness, calibration proxy, transcript formats by dbhat93 · Pull Request #32 · noamseg/interview-coach-skill

dbhat93 · 2026-03-02T15:30:06Z

Summary

Improvements to 6 reference files and the decode + feedback commands, built while deeply using the skill. These were developed collaboratively and pressure-tested before submission.

What's included

references/challenge-protocol.md (new file)

Graduated activation levels (L3=Assumption Audit only, L4=+Blind Spot, L5=all 5 lenses) — challenge begins earlier and becomes familiar rather than jarring
Challenge Debt: tracking mechanism for unresolved challenges across sessions (records, surfaces at next relevant session, expires after 3 sessions)
6 command-specific invocations with level-specific behavior

references/calibration-engine.md (new file)

Practice Calibration Proxy (Section 8): plateau detection without real interviews — triggers when 5+ practices show ≤0.3 score variance over 3 sessions. New status: practice-calibrating
Score Confidence Display (Section 9): Low/Medium/High confidence table with when-to-surface guidance

references/story-mapping-engine.md (new file)

Framing Variety Constraint with 3-criteria Substantiation Test replacing the "no story twice" rule — a strong story told twice with different angles beats forcing a weaker story into one slot
Story Freshness Score (Section 5): 3-factor decay model (use frequency risk, experience age, Proven Performer offset) with 4 verdicts (Fresh / Fresh(Proven) / Moderate / Stale)
Freshness column added to output schema mapping matrix

references/transcript-formats.md (new file)

Added Fireflies.ai as a 9th supported format (was missing)
Zoom Webinar distinction (panelist vs. attendee label conventions)
Granola quality signal: cross-check AI summary against raw transcript to detect truncation or AI hallucination
Format Fingerprint: 5-signal fallback for ambiguous cases before defaulting to Manual

references/commands/decode.md (new file)

Pathway dimension (Referral/Warm intro/LinkedIn/Cold/Internal) added to batch triage with explicit tier-boost logic (+1 tier for Referral/Warm, +0.5 for Internal)
JD Honest Limitations surfaced as named section in every Standard/Deep output — not buried

references/commands/feedback.md (new file)

Memory reliability tagging for Type D (Same-day=high, 1–2 days=medium, 3+ days=low)
Guided elicitation prompts for thin Type D memories (4 specific questions to surface what's actually there)
Rejection Leverage defers to graduated challenge levels instead of hardcoding L5

Test plan

Review each reference file for internal consistency
Verify command-specific invocations in challenge-protocol match the invocation points in stories, analyze, practice, progress, hype, feedback
Check that transcript-formats disambiguation rules cover all 9 formats
Confirm story-mapping-engine integration points reference correct command files

🤖 Generated with Claude Code

…ation proxy, transcript formats challenge-protocol.md: - Graduated activation (Level 3 = Assumption Audit only, Level 4 = + Blind Spot, Level 5 = all 5 lenses) instead of binary Level 5 only - Challenge Debt: unresolved challenges tracked across sessions and surfaced at the start of the next relevant session instead of silently dropped - Avoidance Confrontation now has distinct Level 3/4 behaviors, not just a single Level 5 mode story-mapping-engine.md: - Framing Variety Constraint replaces "no story twice" rule: a story may appear twice in a mapping if the angles are substantively different, tested via a 3-criteria Substantiation Test (different competency, different STAR section, different interviewer takeaway) - Story Freshness Score: 3-factor decay function (use frequency risk, experience age risk, Proven Performer offset) replacing raw use-count overuse check. Verdict: Fresh / Fresh (Proven) / Moderate / Stale calibration-engine.md: - Practice Calibration Proxy (Section 8): plateau detection without real interview outcomes — when same answer type practiced 5+ times and scores vary ≤0.3 across 3 consecutive sessions, flag as practice ceiling and prescribe behavioral change. New status: practice-calibrating. - Score Confidence Display (Section 9): scores carry calibration status context (Low/Medium/High confidence) so candidates don't treat estimates as ground truth - De-escalation Check: root causes are explicitly resolved when affected dimensions improve, preventing over-diagnosis drift - Explicit transition rule: practice-calibrating → calibrating when 3+ real outcomes arrive transcript-formats.md: - Added Fireflies.ai as 9th supported format with disambiguation rules separating it from Grain - Zoom Webinar distinction: Panelist: prefix handling - Format Fingerprint for ambiguous cases: 5-signal fingerprint + medium- confidence detection decision, gives candidate basis to override - Granola quality signal: AI summary cross-checked against raw transcript, flags possible truncation or hallucination decode.md: - Pathway-weighted batch triage: Referral / Warm intro / LinkedIn / Cold / Internal dimension added to ranking. A Long-Shot Stretch with a Referral ranks above an Investable Stretch cold application. - JD Honest Limitations surfaced in every Standard/Deep output as a named section (not just reference knowledge). Quick Scans include a one-line reminder. Candidates never walk away treating a decode as a complete picture. feedback.md: - Memory reliability tagging for Type D (Post-Session Memory): Same-day / 1-2 days / 3+ days tags applied to Question Bank entries - Guided elicitation prompts for thin memories: surfaces what's actually there rather than accepting vague recall as complete capture - Rejection Leverage defers to graduated Challenge Protocol levels instead of hardcoding Level 5 behavior Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

dbhat93 · 2026-04-01T19:13:01Z

Closing this — the changes here have been superseded by PR #35, which consolidates all v3.1-v3.3 work into a single PR. Thanks!

dbhat93 closed this Apr 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve reference files: graduated challenge, story freshness, calibration proxy, transcript formats#32

Improve reference files: graduated challenge, story freshness, calibration proxy, transcript formats#32
dbhat93 wants to merge 1 commit intonoamseg:mainfrom
dbhat93:contribute-upstream

dbhat93 commented Mar 2, 2026

Uh oh!

dbhat93 commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dbhat93 commented Mar 2, 2026

Summary

What's included

Test plan

Uh oh!

dbhat93 commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants